C:/Users/Vassilis/Documents/research/Paper Submissions/Conferences/ICASSP-2011-clustering/revision/robust_clustering_revised.dvi

نویسندگان

  • Pedro A. Forero
  • Vassilis Kekatos
  • Georgios B. Giannakis
چکیده

Clustering is a basic task in a variety of machine learning applications. Partitioning a set of input vectors into compact, wellseparated subsets can be severely affected by the presence of modelincompatible inputs called outliers. The present paper develops robust clustering algorithms for jointly partitioning the data and identifying the outliers. The novel approach relies on translating scarcity of outliers to sparsity in a judiciously defined domain, to robustify three widely used clustering schemes: hard K-means, fuzzy K-means, and probabilistic clustering. Cluster centers and assignments are iteratively updated in closed form. The developed outlieraware algorithms are guaranteed to converge, while their computational complexity is of the same order as their outlier-agnostic counterparts. Preliminary simulations validate the analytical claims.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reporting quality of submissions to the National Conferences on Electronic Learning in Medical Education: implications from Iranian research performance

Background: Reporting quality of research on medical education has come under scrutiny in recent years in wake of empirical evidence. Poor reporting quality of published abstracts may distract readers from careful reading of research evidence or in a worst case mislead scientists. Main objective of this study was to evaluate the extent and quality of the submitted abstracts to the 3rd and 4th N...

متن کامل

Towards a SIGOPS Policy on Subsequent Publications

The Problem Often, during the course of a research project, papers get published on similar topics, perhaps with overlapping content and contributions. For example, a research group might publish a 5-page workshop paper (e.g. at HotOS), later produce an extended version of this paper for a conference (like SOSP), and then produce a revised version for a journal (like TOCS); another version of t...

متن کامل

Storing and Querying Multiversion XML Documents using Durable Node Numbers

Managing multiple versions of XML documents represents an important problem for many traditional applications, such as software configuration control, as well as new ones, such as link permanence of web documents. Research on managing multiversion XML documents seeks to provide efficient and robust techniques for storing, retrieving and querying such documents. In this paper, we present a novel...

متن کامل

ODBASE 2013 PC Co-Chairs Message

We are happy to present the papers of the 10th International Conference on Ontologies DataBases, and Applications of Semantics, ODBASE 2011, held in Heraklion, Crete (Greece), in October 2011. The ODBASE conference series provides a forum for research on the use of ontologies and data semantics in novel applications, and continues to draw a highly diverse body of researchers and practitioners b...

متن کامل

Softerware: Replace SAS Programs with XML Documents to Help People and Computers Be Happier with Each Other

In a novel use of electronic documents to replace labor-intensive programming, a SAS-based integration has been implemented that uses XML to automate the creation, revision, and reuse of the publication-quality statistical tables prominent in regulatory submissions. Table content and style revision is XML document-based, not programming-based. The XML documents that define style and content can...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011